A comparison of approaches to account for uncertainty in analysis of imputed genotypes.

نویسندگان

  • Jin Zheng
  • Yun Li
  • Gonçalo R Abecasis
  • Paul Scheet
چکیده

The availability of extensively genotyped reference samples, such as "The HapMap" and 1,000 Genomes Project reference panels, together with advances in statistical methodology, have allowed for the imputation of genotypes at single nucleotide polymorphism (SNP) markers that are untyped in a cohort or case-control study. These imputation procedures facilitate the interpretation and meta-analyses of genome-wide association studies. A natural question when implementing these procedures concerns how best to take into account uncertainty in imputed genotypes. Here we compare the performance of the following three strategies: least-squares regression on the "best-guess" imputed genotype; regression on the expected genotype score or "dosage"; and mixture regression models that more fully incorporate posterior probabilities of genotypes at untyped SNPs. Using simulation, we considered a range of sample sizes, minor allele frequencies, and imputation accuracies to compare the performance of the different methods under various genetic models. The mixture models performed the best in the setting of a large genetic effect and low imputation accuracies. However, for most realistic settings, we find that regressing the phenotype on the estimated allelic or genotypic dosage provides an attractive compromise between accuracy and computational tractability.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selection of Variables that Influence Drug Injection in Prison: Comparison of Methods with Multiple Imputed Data Sets

Background: Prisoners, compared to the general population, are at greater risk of infection. Drug injection is the main route of HIV transmission, in particular in Iran. What would be of interest is to determine variables that govern drug injection among prisoners. However, one of the issues that challenge model building is incomplete national data sets. In this paper, we addressed the process ...

متن کامل

Complex Risk Analysis of Investing in Agriculture ETFs

The aim of the paper is to present a complex risk analysis of investing in agriculture Exchange Trade Funds (ETFs). The specific characteristics of agricultural investments should be taken into account as from the direct financial investments into agricultural ETFs, as for the general portfolio approach applying. To achieve the objectives of the work, the authors structured agriculture ETFs int...

متن کامل

تنوع ژنتیکی ژنوتیپ های لوبیا (L. vulgaris Phaseolus) در شرایط تنش خشکی

To evaluate genetic diversity and to determine the relationships between yield and other importante traits among bean genotypes, an experiment was conducted in Random Completely Block Design (RCBD) with three repetitions under both normal and drought stress conditions in 2015-2016 crop season on 30 bean genotypes  at Tehran University research farm. The results of variance analysis indicated hi...

متن کامل

ارزیابی صحت پیش‌بینی ژنومی در معماری‌های مختلف ژنومی صفات کمی و آستانه‌ای با جانهی داده‌های ژنومی شبیه‌سازی‌شده، توسط روش جنگل تصادفی

Genomic selection is a promising challenge for discovering genetic variants influencing quantitative and threshold traits for improving the genetic gain and accuracy of genomic prediction in animal breeding. Since a proportion of genotypes are generally uncalled, therefore, prediction of genomic accuracy requires imputation of missing genotypes. The objectives of this study were (1) to quantify...

متن کامل

Microsatellite Analysis for Differentiation and Identification of the Source Tree of Fagus orientalis Lipsky

The present study describes approaches for the identification of individual beech trees using maternal tissues from their seeds or fruits. Four microsatellite markers were used for genetic analysis of seedlots from Fagus orientalis Lipsky, a highly out-crossing tree species. Seeds from 11 single-tree harvests belonging to one population, (7 seeds from each), as well as non-paranchymatic materna...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetic epidemiology

دوره 35 2  شماره 

صفحات  -

تاریخ انتشار 2011